Model Selection

Whisper fine-tuning

# Whisper fine-tuning

KinyaWhisper is a fine-tuned Kinyarwanda automatic speech recognition (ASR) system based on OpenAI's Whisper model, specifically designed for low-resource indigenous languages.

Speech Recognition

Transformers Other

Whisper Small Ta

This model is a speech recognition model fine-tuned on the Tamil Common Voice 17.0 dataset based on OpenAI's Whisper Small, with a Word Error Rate (WER) of 43.23%.

Speech Recognition

Transformers Other

Indian Accent English Whisper Finetuned

Fine-tuned the openai/whisper-large-v3-turbo based on the Indian English accent dataset, which is more suitable for speech recognition of Indian English accents.

Speech Recognition

Transformers English

Quran Whisper Base Fine Tune

This model is a fine-tuned Arabic speech recognition model based on openai/whisper-base on the quran-ayat-speech-to-text dataset, specializing in the task of converting Quranic verses from speech to text.

Speech Recognition

Transformers Arabic

Whisper Base Pl

A speech recognition model fine-tuned on the Polish Common Voice 17.0 dataset based on OpenAI Whisper-base

Speech Recognition

Transformers Other

Viwhisper Medium

Whisper-medium model optimized for Vietnamese speech recognition tasks, fine-tuned on 1308 hours of Vietnamese data

Speech Recognition

Transformers Other

Whisper Large V3 Cantonese

A Cantonese automatic speech recognition model fine-tuned on Whisper v3, trained on the Common Voice 17 dataset

Speech Recognition

Transformers Other

Akan Whisper Model

A fine-tuned version of OpenAI's Whisper model, specifically designed for automatic speech recognition tasks in the low-resource Ghanaian language Akan

Speech Recognition

Transformers Other

Whisper Small Khmer

A speech recognition model fine-tuned based on openai/whisper-small, specifically optimized for Khmer transcription accuracy

Speech Recognition

Transformers Other

Whisper Tiny Myanmar

This model is an automatic speech recognition (ASR) model fine-tuned on Burmese speech datasets based on openai/whisper-tiny, supporting Burmese speech-to-text tasks.

Speech Recognition

Transformers Other

Whisper Large V3 Myanmar

This model is an automatic speech recognition model fine-tuned on the Burmese speech dataset based on openai/whisper-large-v3, specifically designed for Burmese speech transcription.

Speech Recognition

Transformers Other

Monsoon Whisper Medium Gigaspeech2

Monsoon-Whisper-Medium-GigaSpeech2 is a Thai automatic speech recognition (ASR) model, based on Whisper-Medium and fine-tuned on the GigaSpeech2 dataset, suitable for speech recognition in real-world scenarios.

Speech Recognition

Akylai STT Small

Kyrgyz Whisper ASR is a customized automatic speech recognition solution specifically designed for the Kyrgyz language, fine-tuned based on the pre-trained Whisper model.

Speech Recognition

Transformers Other

the-cramer-project

Whisper Large V3 Taiwanese Hakka

A Whisper-large-v3 fine-tuned model for Taiwanese Hakka speech recognition, supporting multiple Hakka dialects

Speech Recognition

Transformers Other

Detect Language

A language identification model fine-tuned based on the Whisper Medium model, specifically designed for language classification tasks on the FLEURS dataset

Audio Classification

apparaomulpuriril

Whisper Sinhala Audio To Text

A Sinhala speech recognition model fine-tuned based on openai/whisper-small, supporting conversion of Sinhala speech to text.

Speech Recognition

Whisper Small Kyrgyz

Kyrgyz automatic speech recognition (ASR) model based on the Whisper architecture, developed with support from the National Commission on Language and Language Policy under the President of the Kyrgyz Republic

Speech Recognition

Transformers Other

Whisper Tiny Vi

Vietnamese automatic speech recognition (ASR) model fine-tuned based on OpenAI Whisper-tiny architecture, demonstrating excellent performance on multiple Vietnamese datasets

Speech Recognition

Transformers Other

Phowhisper Medium

PhoWhisper is a series of models designed specifically for Vietnamese automatic speech recognition (ASR). It achieves high robustness by fine-tuning the Whisper model on an 844-hour Vietnamese accent dataset.

Speech Recognition

Transformers Other

Phowhisper Small

PhoWhisper is a system specifically designed for Vietnamese automatic speech recognition, fine-tuned based on the Whisper model, supporting various Vietnamese accents.

Speech Recognition

Transformers Other

Phowhisper Large

PhoWhisper is a system specifically designed for Vietnamese automatic speech recognition, fine-tuned based on the Whisper model, supporting various Vietnamese accents.

Speech Recognition

Transformers Other

Whisper Small Fa

The Whisper (small) model fine-tuned by the Hezar team based on the Persian part of the Common Voice dataset, which can be used for automatic speech recognition tasks.

Speech Recognition Other

Whisper Large V2 Spanish

A speech recognition model fine-tuned on the Common Voice 13.0 Spanish dataset based on OpenAI Whisper-large-v2

Speech Recognition

Asr Whisper Medium Commonvoice Fa

A fine-tuned whisper medium model based on the CommonVoice-14.0 Persian dataset for Persian automatic speech recognition tasks.

Speech Recognition Other

This is a Bengali automatic speech recognition model based on the Whisper small architecture, fine-tuned on approximately 400 hours of Mozilla Common Voice dataset with a word error rate of 4.58%

Speech Recognition

bangla-speech-processing

Afrispeech Large A100

An African language speech recognition model fine-tuned from Whisper-large-v2, trained on the afrispeech-200 dataset with a word error rate (WER) of 14.81

Speech Recognition

Whisper Medium Arabic

An Arabic speech recognition model fine-tuned based on openai/whisper-medium, supporting streaming processing

Speech Recognition

Whisper Large V2 Spanish

Spanish speech recognition model fine-tuned based on openai/whisper-large-v2, achieving 8.55% WER on Common Voice 11.0 Spanish test set

Speech Recognition

Whisper Large V2 Kazakh

This model is a fine-tuned speech recognition model based on OpenAI's Whisper Large V2 on the Kazakh Common Voice 11.0 dataset

Speech Recognition

Transformers Other

Whisper Medium Portuguese

A Portuguese speech recognition model fine-tuned on the common_voice_11_0 dataset based on openai/whisper-medium, with a word error rate of 6.5987

Speech Recognition

Transformers Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase